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Title: A method for in vivo production of a mutant, library in 
cells 

5 FIELD OF THE INVENTION 

The present invention relates to methods for in vivo production 
of libraries of polypeptide variants, the screening of these 
variants and selection of those exhibiting desired properties. 
10 The invention furthermore relates to methods for producing the 
desired polypeptide variants. 

BACKGROUND OF THE INVENTION 

15 An increasing 'nurnber of polypeptides, including enzymes and non- 
enzym.atic proteins, are being produced industrially, for use. in 
various industries, household, food/feed, cosmetics, medicine 
etc. One of the major sources for these proteins is and have been 
microorganism found in nature. 

20 

The classical approach for finding polypeptides with new and 
special properties, have been to screen wild type organisms 
present in nature. This has been a very successful way of 
procuring polypeptides to be used in such diverse areas as the 
2 5 above mentioned applications. 

Hov;ever, often it has not been possible to produce such 
pol^/peptides in sufficient amounts because the quantities 
produced in the natural host systems were too minute to allow a 
30 production, and even if the cost was no problem, difficulties 
could be encountered in providing sufficient amounts in relation 
to the demand (e.g. human growth hormone) . 

Such problems have to a large degree been overcome by the advent 
35 of recombinant techniques for the production of polypeptides. In 
this art polypeptides are produced by the use of biological 
systems. Genes encoding certain polypeptides are cloned and 
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transferred into cells chat will produce the polypeptides in 
quantities much larger than those, wherein they are produced in 
the original organism. Over the latest twenty years a large 
number of methods for the production of polypeptides according to 
5 such techniques have been developed. 

Often, proteins from natural sources do not meet the requirements 
for certain applications, and it will be necessary to modify 
existing proteins towards certain activities or biophysical 
10 properties. 

It is possible to generate new variants of a protein by classical 
mutagenesis of the microorganisn^. using radiation (X-ray and UV) 
or chemical mutagens. However, since this approach is a very* 
15 labour and "time consuming process, in the same last two decades 
researchers have been deveicping improvements on existing 
polypeptides by using more specific and selective recom±)inant 
techniques, such as protein ar,z penetic engineering for creating 
artificial diversity. 

2 0 

Based upon consideracio."s usir.p knowledge of the structure- 
function relationships and general protein chemistry, researchers 
have come a long v/ay ir. designing polypeptide variants exhibiting 
i"orovements in various properties . 

2 5 

Hov/ever, it has also beer, realised chat the various interactions 
into which pol\^eptides take pare, are so complex that rational 
desir.:; according to such knowledge has serious limitations, and 
in recent years methods employing random mutagenesis followed by 
30 screening of or selection fro" very large numbers of variants 
produced therefrom^ has gained in::erest. 



For this purpose a microbial 
subsequent expression and 
35 possessing the desired proper 
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Over the years many both in vitro and in vivo DNA mutagenesis 
techniques for creating high numbers of different variants of 
polypeptides have been developed. 

5 Considering the fact that a typical naturally occurring 
polypeptide consists of between 100 and 1000 amino acids, and 
each may be varied in 20 ways (only to stay within the naturally 
occurring amino acids) , the number of possible variants for a 
specific polypeptide is enormous. Since the main parameter that 
10 defines or measures the usefulness of a microbial collection or 
library used to identify improved variants of polypeptide is the 
number of different variants, N, which is comprised in the 
collection, a need for large libraries has emerged. 

15 Especially in cases when a powerful selection systerr, is 
available, the limiting factor for the identification of the 
desired polypeptide is the size of tihe library. 

In in vitro systems the practical, scate of art, limit for N is 
20 about 10\ This is mainly due to inefficiency of transformation 
(introduction of DNA into the cell) of the manipulated DNA into' 
the host organism. This nunnbsr varies a lot from organism to 
organism: in the presently best case, E, coli, the usual 
efficiency of transformation of in vitro manipulated DNA, e.q. a 
:5 ligation of DNA fra-grr.ents or chemical treatment of DN.^v, leads at 
the most to library sizes up to 10^ bacteria {Greg Winter, 
Curre.-t methods in Iirju^Jinology 5: 253-255, 1993). Ver*/ few 
examples of libraries of this size have been reported, 

30 In vitro library constructions in other prokaryotes, such as 
Bacillus sp., Streptococcus sp, or Staphylococcus sp. will for 
practical reasons be orders of magnitude belov; this number. 

Considering eukaryotic hosts such as Saccharomyces cerevisiae or 
35 various Aspergillus sp. , an even lov;er number of transf ormants 
can be expected from in vitro manipulated DNA. 
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A special case of a large library has been reported based on in 
vivo recombination between libraries of antibody light and heavy 
chains based on a specially designed system useful for that 
5 particular case (Griffiths, A.D. et aJ . , 1994, EMBO J. 14: 3245- 
3260) . 

A number of methods are available to generate variants of a 
polypeptide in microorganisms in vivo, ranging from very simple, 

10 such as treating cells with chemical or physical mutagens, to 
rather complex, relying on cells that contain an error-prone DNA 
polymerase but lack the mismatch repair system which corrects the 
errors (Stratagene, XLl-red (nutS, wutD, mutT) Catalog #200129) . 
But these techniques have a major drawback as the mutagenesis is 

15 not targeted to a specific part of the genome (coding for the 
polypeptide of interest) and high frequencies of mutations are 
generated also in essential genes for the cell as well as ' in the 
target gene, resulting in r.assive cell death, together with a 
high number of cells, where zhe rr.utations do not influence the 

20 polypeptide of interest. Such "noise" will limit the accumulation 
of mutations in the target region. 

It is therefore the objec. of the invention to provide an in vivo 
target region- specific mutagenesis procedure in order to produce 
25 very large nurrih^ers, of pz-l-.peptide variants. 

A second object of the invention relates to the screening or 
selection of variants v;ith the desired properties, both bv 
existing and future technologies. 

30 

SUi-n^L^RY OF THE I^4"^S^?TI0^' 

The present invention therefore relates to a method for in vivo 
35 production of a library in cells comprising a multitude of 
mutated genetic elements, wherein an error-prone polymerase is 
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used in ' each ancestral cell to replicate all or a part of a 
genetic element comprising 

i) an origin of replication from which replication is 

initiated, 

5 ii) optionally a genetic marker, e.g. a gene conferring 
resistance towards an antibiotic, 
iii) a gene encoding the polypeptide of interest, 
independently of the host chromosomal replication machinery. 

0 The invention furthermore relates to a method for the generation 
of a DNA sequence encoding a desired variant of a polypeptide of 
interest, wherein 

i) a mutant library is produced by the above method, 

ii) said library is cultivated under conditions conducive for 
5 the .expression of said gene of interest to produce 

polypeptide variants , 

iii) said variant polypeptides are screened or selected for a 
desired property, and hosts producing such desired 
variants identified and is:::lated, 

iv) said genetic element in said hosts is sequenced to 
elucidate the DNA seq^uence of the mutant gene encoding a 
desired variant . 

and a method for the de terr.inat ion of the DNA sequence encoding a 
desired variant of a polvpeptiiGe of interest, wherein 

(i) a "utant library is produced by the above method, 

(ii) said library' is cultivated under conditions conducive for 
the expression of said gene of interest to produce variant 
polypeptides ; 

(iii) said variant polypeptides are screened or selected for a 
desired property, and hosts producing desired variants 
identified and isolated, 

(iv) said genetic element in said hosts is sequenced to 
elucidate the DNA sequence of the m.utant gene encoding a 
desired variant. 
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The screening of the library or the selection of the variants 
depends on the specific polypeptide and which properties thereof 
it is desired to improve and/or retain. It is therefore necessary 
to set up a screening protocol for each case. Such protocols 
5 involving a number of assays are described in the literature 
(Clackson et al . , Nature 352:624-628, 1991, Bryan, ? et aJ . , 
Proteins 1:326-334, 1986). 

An elegant approach to the combination of the generation of 
10 diversity and the selection of variants with the desired 
properties would be a combination of the in vivo method of the 
invention for generating the diversity with a phage display 
system (Greg Winter, Supra) , 

15 A specific example of a polypeptide of interest is the alkaline 
proteases used in the detergent industry for the removal of 
proteinaceous stains from fabric. In that case the screening may 
be performed in actual detergent compositions to investigate 
properties such as thenr.al stability, oxidation stability, 

20 storage stability, substrate specificity and. affinity, stability 
to non- aqueous solvents, pH profile, ionic strength dependence, 
catalytic efficiency, and wash per fcrrria nee . 

Furthermore the invention relates tc a process for the production 
2 5 of a desired polypeptide '/ariant, v;herein 

(i) a DNA sequence encoding a polypeptide of interest that has 
been determined according to the method above is intro- 
duced into a suitable host in a manner whereby it can be 
expressed in said host, 
30 (ii) said host is cultivated under conditions conducive to the 
expression of said DNA sec^jence, and 
(iii) said polypeptide variant is recovered. 

Methods for the introduction of the DNA sequence selected into 
35 suitable host systems are described in, (Sambrook et al. 
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Molecular Cloning: A Laboratory Manual. Cold Spring Harbor Lab., 
Cold Spring Harbor, NYJ. 

It is also within the abilities of the skilled person to select 
5 suitable growth media and other conditions for the host system 
selected that are conducive for the expression of the polypeptide 
variant od interest. Guidance hereto may f. ex. be found in 
(Sambrook et ai . , supra), 

10 Also for the recovery of the polypeptide a large number of 
methods are available for the separation and purification of 
proteins," e.gr. in (Scopes, R.K., protein Purification (1987), 
Springer -Ver lag) 

15 Lastly, the invention relates to the polypeptides produced by the 
above method . 

DETAILED DSSCRIPTIOM OF THE I>r;E:mON 

20 

The invention comprises a rr;ethod to construct in vivo libraries 
of variants in a gene of interest . The method involves the use of 
a genetic element, such as a bacteriophage or a plasmid that is 
able to replicate independently ' of the host chromosomal 

25 replication syste". By the use of the possibility to separate the 
replication of the host chromosome and "the replication of the 
genetic element (phage/plasmid) , it is possible by modifications 
of one replication system to selectively introduce mutations in 
the genetic element (phage/plasmid) keeping the chromosome of the 

30 host intact. This means that the generation of variation in the 
gene of interest does not compromise the viability of the host. 

DNA replication is a highly accurate process , and the 
misincorporation rate of the chrcmosomal replication in E. coli 
35 has been estimated to be in the order of 10"^^ pr. base pr. round 
of replication. The base pairing carried out during Dl^A 
replication leads to a preference for the polymerase to 
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incorporate the correct base at a certain position, which 
accounts for approximately 10^ of the overall replication 
fidelity. If an incorrect base has been incorporated, the 
polymerase will stall and a 3 '-5' exonuclease will often remove 
5 the 3' misincorporated base. This part denoted proof-reading 
accounts for approximately lO' of the overall replication 
fidelity. The repair system of the cell accounts for the last 10^ 
of the fidelity rate. 

10 Replication of the chromosome in E. coli is for the most part 
carried out by DNA polymerase III holoenzyme, which is a multi- 
protein complex containing 10 different polypeptides including a 
polymerase (alpha sub-unit, poiC gene) and a 3 '-5' exonuclease 

(dnaO gene) . 

i 5 

A further polymerase, DNA pol>'::^.era3e I (DNA pol I, polA gene), 
contains three different activities, viz. a DNA polymerase 
activity, a 3 '-5' exonuclease a-tivicy, and a 5 '-3' exonuclease 
activity despite the fact that it is one single polypeptide. This 
20 polymerase has several functions in the cell. Besides DNA repair, 
DNA pol I is also needed for the chromosomal DNA replication, as 
it is involved in the asserJoly of DNA fragments during synthesis 
of the lagging DNA strand. Hov;ever, it replicates only a very 
"inor portion of the genor.e . 

This DolvTTierase is f urtherrr.ore involved in initiation of DNA 
replication of certain classes of plasmids, e.g. ColEI origin of 
reolication-based plasmids such as p3R322 in Escherichia coli and 
Gram-negative bacteria or pAMSl like plasmids in Gram-positive 

30 bacteria. Such plasmids may be able to replicate completely 
through the activity of DNA polym.erase I without DNA polymerase 
III being present in active form, e.g. if this enzyme is 
dysfunctional due to genetic causes, e.g. temperature sensitive 
variants at a non-permissive temperature) , or under conditions 

35 where only limited amounts of DNA Polymerase III are present in 
the cell. 
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It is well known that certain mutations lead to a decrease (or an 
increase) in the fidelity of DNA replication. These mutations 
have been mapped to reside mainly in polymerases, exonucleases or 
5 in elements of the repair system. Unfortunately, most of these 
mutations alter/impair the fidelity rate for the complete genome 
present, and such non- targeted mutations are not desirable. 

One example of such a mutation could be an inactivation of the 
10 3 '-5' exonuclease activity of DNA pol I, 

However, according to the invention use is being made of the fact 
that some elements in the replication system may be terr.porarily 
"switched" off, fully or partially, thereby stopping or greatly 
15 slowing dom the replication of the genome, while replication of 
certain genetic elements as defined herein is continued. 

An E. coll strain containing a terr.cerature sensitive DKA pol III 
(i.e. the polymerase a-sub-unit or another temperature sensitive 

20 sub-unit ■ that render the holoenzyme conditionally non- 
functional) , or a function required for initiation of chrorr.osomal 
replication, such as Dna.^, an error prone DNA pol I and a colEI 
based plasrr.ia containing a gene of interest, is an example of a 
genetic system according tc the invention designed to specif i - 

2 5 caily introduce "utaticns m the plasriid (and the gene of 
interest) . 

In such a system raising the temperature to a non-permissive 
value will have the effect that DNA pol III cer.ses to function 
30 fully, while the error prone DNA pol I will retain its function 
and replicates the plasm.id v;ith reduced fidelity resulting in 
mutated copies of the plasmid. 

Since the generation of mutations is random., each cell will 
35 generate unique m.utations and upon lov/ering the temperature, the 
temperature sensitive function v;ill become active again, and 
normal replication of the cells continue. 
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A variation' would be an E. coli strain with temperature sensitive 
alleles of polIII and poll and an inducible expression (by a ts 
repressor (temperature-sensitive) or by chemical induction) of 
the error-prone polymerase. At a restrictive temperature and the 
presence of the inducer mutations will accumulate in the genetic 
element. At permissive temperature and the absence of the 
inducer, the complete systems functions as the wild type cell. 

Accordingly the invention in its first aspect relates to a method 
for in vivo production of a mutant library in cells comprising a 
multitude of mutated genetic elements, wherein an error-prone 
polymerase is used in each ancestral cell to replicate all or a 
part of a genetic element comprising 

i) an origin of replication from which replication is 
initiated, 

ii) optionally a genetic marker, e.g. a gene conferring 
resistance towards ar. antibiotic, 

iii) a gene encoding the polv-pepcide of interest, 
independently of the host chrorr.osonal replication mtachinery. 

The invention conse^raent ly comprises a method for in vivo 

production of a library in cells comprising a multitude of 

mutated genetic elements corr.prising 

A) providing a cell having 

i) an error-prone polym.erase that independently of the 

chromosomal replication miachinery of said cell will 
replicate all or a part of a genetic element 
comprising 

a) an origin of replication from which replication 
is initiated, 

b) optionally a genetic marker, e.g. a gene 
conferring resistance towards an antibiotic, 

c) a gene encoding the polypeptide of interest, 
and 
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ii) a chromosomal replication machinery that can be 
reversibly induced to be substantially non- 
functional, 

B) growing such a cell under conditions conducive to its 
replication to obtain a multitude of ancestral cells, 

C) reversibly inducing said chromosomal replication machinery 
in said ancestral cells to be substantially non-functional 
for a period of time sufficient to allow for the 
replication of said genetic element by said error-prone 
polymerase to generate mutations in said genetic element, 

D) reversibly inducing said chromosomal replication machinery 
in such mutated cells to be substantially functional, and 

E) growing such mutated cells 'under conditions conducive to 
their replication . 

In this context the expression "mucant library" means a set -of 
cells, bacteria or phages {typically 10^ to 10^^ cells or phages) 
that differs with respect to one particular gene encoding a 
polypeotide of interest. Typically one would like to introduce 
one or more different amino acid alterations in this particular 
polypeptide in each merriber of the library. 

In this context: the expression "error-prone polymerase" means a 
polyiTierase that during DNA replication will incorporate mistakes 
(one of the v;ronc nucieccides in a given position or cause a 
deletion or an insertion of one cr several nucleotides) with 
hiaher frequency than the polyr.erase normally used for this 
purpose (e.g. E. coli DNA pol I, Bacillus subtilis DNA pol I, T4 
DNA polymerase, T7 DNA polyr.erase) . 

The expression "ancestral cell" here means such cells wherein no 
mutations have been introduced. In some embodiments of the 
invention the m.utation cycle nay be reiterated, and in that case, 
such cells that were initially r.utated become ancestral cells for 
the second miUtation cycle, etc. 
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In this context the expression "host chromosomal replication 
machinery" means the DNA polymerase or DNA polymerase holoenzyme 
that is mainly responsible for the replication of the host 
chromosome, e.g. DNA polymerase III in E. coli. 

5 

In this context the expression "genetic element" means a small 
(from 1 or 2 kilo bases to 100 kilo bases) entity consisting of 
RNA or DNA, that is able to replicate independently, i.e. it 
contains an origin of replication. The genetic element would 
10 typically be a bacteriophage, a phagemid, or a plasmid. The 
genetic element must also according to the invention comprise a 
gene encoding the polypeptide of^ interest, and it may further 
comprise a genetic marker, e.g. a gene conferring resistance 
towards an antibiotic. 

15 

A virus, a retrovirus, or a trar.sposon that is able to replicate 
independe.ntly of the host replication machinery, e.g. 
retrotransposons could also be usei as the "genetic element" . 

20 Kim and Loeb (1995, PNAS 92: 6 5-':-5S3) have demonstrated that HIV 
reverse transcriptase (HIV-RT; is able to complem.ent E, coli DNA 
pol I with respect to chrcr.osor.al D:-:a replication and initiation 
of plasrr.id DNA repl icat ior. . The r.isincorporat ion rate of KIV-RT 
(and related retroviral reverse transcriptases) is several orders 

25 of rtacninude higher than ine rate of DNA pol I, i.e. 10*^ to 10*^ 
nisincorporat ions pr . base pr . round of replication. The use of 
such a polymerase in stead of a mutated error prone ■ f:. coli DNA 
pol I in an embodiment of the invention would significantly 
increase the frequency of replication errors in the system 

30 described above. 

In a further embodiment of the invention the mutation cycle 
described above can be reiterated, i.e. the mutagenic polymerase 
switched on and off several tirr.es, thereby generating even more 
35 mutants. Such a step could furthermore help the segregation of 
plasmids if a multicopy plasmid is used as the genetic element. 
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In certain genetic elements one can envision that only the part 
of the genetic element located in the vicinity of the origin of 
replication is replicated by the error-prone polymerase. In such 
cases, the gene of interest should be situated within this 
region. 

In a specific enabodiment of the invention the genetic element is 
a phage, wherein the gene encoding the polypeptide of interest is 
positioned at a locus where the polypeptide upon expression is 
displayed from the surface of the phage, whereby a screening can 
be performed directly (see Greg Winther, supra) . To ensure the 
correspondence between DNA seq^aence of the phage and the protein 
displayed the primary phage stock should be passed through wild 
type E, coli, at low m.ult iplicicy of infection, prior to 
selection or screening. 

To further increase the freq^aenc-/ of r.utationS; the method of the 
invention comprises eribodimenus v;here the method is used in 
conjunction with a repair deficient host, e.g. mutL, mutS, wutH,., 
or a combination of mutator genome types. 

In this context the expression "repair deficient host" means a 
cell containing one or r.ore alterations in genes encoding 
proteins kno*^ to be directly or indirectly involved in the DNA 
repair. The result of such nutations is that a higher frequency 
of introduced mutations (by the pol%^.erases , chemicals, X-ray, UV 
light, etc.) v;ill not be repaired_ and will be "permanently" 
incorporated in the genome, the so called mutator phenotype. 
ExamiOles of such genes are nutL, mutS, mutH, mutT. 

As the genetic element one could as indicated above use a 
phagem.id in stead of a plasmid in order to couple the variant 
generation to a display system, e.g. M13, fl, fd. 

In this context the expression "phagemid" means a plasmid that 
besides its plasmid origin of replication contains a phage origin 
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of replication, phagemids are dependent of the conditions able t 
replicate as a plasmid or as a phage (upon infection with . 
helper-phage) . 

5 A phagemid based system would involve the construction of i 
phagemid containing: 

1) a plasmid origin of replication, e.g. ColEI 

2) a M13 phage origin of replication 
a chimeric gene consisting of the gene of interest fused 
to the gene encoding GUI protein. (0rum, P. et al . , Nucl . 
Acid. Res, 21: 4491-4498, 1993). 



3) 

10 



The first step would be the generation of diversity by 
growing/maintaining an E, coll strain transformed with this 
15 phagemid as' described above. The second step would be the 
infection with the helper phage in order to create single 
stranded phagemid that will be packed into phage particles. The 
phages displaying the variant proteins can then be subjected to a 
selection procedure . 

20 

Also, certain bacteriophages such as T4 or T7 in Eschericia or 
SPOII and ?hi29 in 3acillus contain their O'^t^. DNA polymerases, 
ana according to the invention one could envision embodiments 
where the genetic element described above is a bacteriophage 
5 containing an error-prone D::a polvrr.erase . 

According to the invention the error-prone polym.erase is 
typically selected from the group comprising DNA polym.erase I or 
reverse transcriptases. A preferred error-prone polymerase is a 
0 variant of E. coli DNA polymerase I or HIV reverse transcriptase. 

As a polypeptide of interest a large number is possible, and 
especially such polypeptides exhibiting biological activities 
could^be mentioned. Am.ong these are enzym.es, hormiones, receptors, 
5 blood-clotting factors, anti-microbial agents, and other such 
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polypeptides important for the prophylaxis and treatment of 
various disorders and diseases in humans and animals. 

Also, enzymes used for industrial purposes could be mentioned. 
5 Among such industrial enzymes, enzymes belonging to the groups 
carbonyl hydrolases, carbohydrases, oxidoreductases, trans- 
ferases, phytases, ant i-microbial polypeptides, oxidoreductases, 
isomerases, lyases, and ligases. 

10 In this context the expression "carbonyl hydrolase" means enzymes 
that hydrolyze compounds containing a -C(=0)-X group, where X is 
oxygen or nitrogen. 

Specific classes of enzymes belonging to the group of carbonyl 
15 hydrolases are such as hydrolases (lipases) and peptide 
hydrolases (proteases) . 

Proteases are here meant: as enzvTr.es classified under the Enzyme 
Classification number E.G. 3.4 in accordance with the^ 
20 Recommendations (19S2) of the International Union of Biochemistry" 
and Molecular Biology (lUBMB) . 

Examples include proteases selected from those classified under 
the Enzym.e Classification (E.G.) numbers: 

25 

3.4.11 (i.e. so-called a"inopeptidases) , including 3.4.11.5 
(Prolyl aminopeptidase) , 3.4.11.S (X-pro aminopeptidase) , 
3.4,11.10 (Bacterial leucyl aT:inopeptidase) , 3.4.11.12 
(Thermophilic aminopeptidase), 3.4.11.15 (Lysyl aminopeptidase), 
30 3.4.11.17 (Tryptophanyl aminopeptidase), 3.4.11.18 (Methionyl 
aminopeptidase) . 

3.4.21 (i.e. so-called serine endopeptidases) , including 3.4.21.1 
(Chymotrypsin) , 3.4.21.4 (Tr%psin) , 3.4,21.25. (Cucumisin) , 
35 3.4.21.32 (Brachyurin) , 3.4.21.48 (Cerevisin) and 3.4.21.62 
(Subtilisin) ; 
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3.4.22 (i.e. so-called cysteine endopeptidases) , including 
3.4.22.2 (Papain), 3.4.22.3 (Ficain) , 3.4.22.6 (Chymopapain), 
3.4.22.7 (Asclepain) , 3.4.22.14 (Actinidain) , 3.4.22.30 

5 (Caricain) and 3.4,22,31 (Ananain) / 

3.4.23 (i.e. so-called aspartic endopeptidases) , including 
3.4.23.1 (Pepsin A), 3.4.23.18 (Aspergillopepsin I), 3.4.23.20 
(Penicillopepsin) and 3.4,23.25 (Saccharopepsin) ; and 

10 

3.4.24 (i.e. so-called metallo endopeptidases), including 
3.4.24.28 (Bacillolysin) . 

Examples of relevant subtilisins comprise subtilisin BPN', 
15 subtilisin amylosacchari ticus , subtilisin 168, subtilisin 
mesencericopeptidase, subtilisin Carlsberg, subtilisin DY, 
subtilisin 309, subtilisin 147, thermitase, aqualysin, Bacillus 
PB92 protease, proteinase :<, Protease T\77 , and Protease TV73 . 

20 Specific examples of such readily available commercial proteases 
include Esperase®, Alcalase©, Neucrase®, Dyrazym©, Savinase®, 
Pyrase®, Pancreatic Tr^^T)sin NOVO (PTN), Bio-Feed'" Pro, Clear- 
Lens Pro (all enzymes available from Novo Nordisk A/S) 

25 Examples of other comm.ercial proteases include Maxatase®, 

Maxacal®, Maxapem© marketed by Gist -Brocades N.V., Opticlean© 

marketed by Solvay et Cie. and Purafect® marketed by Genencor 
International . 



30 It is to be understood that also protease variants are 
contemplated as the polypeptide of interest. Examples of such 
protease variants are disclosed in EP 130.756 (Genentech) , EP 
214.435 (Henkel) , WO 87/04461 (Amgen) , VIO 87/05050 (Genex) , EP 
251.446 (Genencor), EP 260.105 (Genencor), Thomas et al . , (1985), 

35 Nature. 318, p. 375-376, Thomas et al., (1987), J. Mol. Biol., 
193, pp. 803-813, Russel et al . , (1987), Nature, 328, p. 496-500, 
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WO 88/08028 (Genex) , WO 88/08033 (Amgen) , WO 89/06279 (Novo 
Nordisk A/S) , WO 91/00345 (Novo Nordisk A/S) , EP 525 610 
(Solvay) and WO 94/02618 (Gist-Brocades N.V.). 

5 The activity of proteases can be determined as described in 
"Methods of Enzymatic Analysis", third edition, 1984, Verlag 
Chemie, Weinheim, vol. 5. 

Lipases are here meant as enzymes classified under the Enzyme 
10 Classification number E.G. 3.1.1 (Carboxylic Ester Hydrolases) in 
accordance with the Recommendations (1992) of the International 
Union of Biochemistry and Molecular Biology (lUBMB) . 

Examples include lipases selected from those classified under the 
15 Enzyme Classification (E.G.) nur±)ers : 

3.1.1 (i.e. so-called Carbox\'lic Ester Hydrolases), including 
(3.1.1.3) Triacylglycerol lipases, (2.1.1.4.) Phospholipase A^^ 

20 Examples of lipases include lipases derived from the following 
microorganisms. The indicated patent publications are in- 
corporated herein by reference: 

Hvmicola, e.g. H. brevispora, H. lanuginosa^ H. brevis var. 

therr.oidea and H. ir.soler.s (US 4,810,414) 

25 

Pseudo-onas, e.g. Ps . fragi, Ps . stutzeri, Ps . cepacia and Ps. 
fluorescens (v;0 89/04361), or Ps . plantarii or Ps . gladioli (US 
patent no. 4,950,417 (Solvay enzymes)) or Ps. alcaligenes and Ps. 
pseudoalcaligenes (E? 218 272) or Ps. ir.endocina {VIO 88/09367 ; US 
30 5,389, 536) . 

Fusariujn, e.g. F. oxysporuin (E? 130,064) or F. solani pisi (WO 
90/09446) . 

35 Mucor (also called Rhizomucor) , e.g. M. miehei (EP 238 023). 
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Chromobacteriim (especially C. viscosum) 

Aspergillus (especially A. niger) . 

5 Candida, e.g. c. cylindracea (also called C. rugosa) or C. 
antarctica (WO 88/02775) or C. antarctica lipase A or B (WO 
94/01541 and WO 89/02916) . 

GeotricuiD, e.g. G. candidum (Schimada et al . , (1989), J. 
10 Biochem,, 106, 383-388) 

Penicilliiun, e.g. P. cajnsmbertii (Yamaguchi et al., (1991), Gene 
103, 61-67) . 

15 Rhizopus, e.g. R. deJemar (Hass et; al . , (1991), Gene 109, 107- 
113) or R. niveus (Kugimiya er al., (1992) Biosci . Siotech. 
Bioche.Ti 56, 716-719) or R. oryzae . 

Bacillus, e.g. B, suhtilis (Dartois et al . , (1993) Bioche.Tiica et 
20 Biophysica acta 1131, 253-26C; or S. s tearothermophilus (J? 
64/7744992) or 3. pui:)ilus (WQ 91/16422). 

Specific exarr.ples of readily available ccrr.mercial lipases include 
Lipolase®, Lipolase''' Uitira, Liposyir.e©, Palatase®, Novozyrri© 435, 
25 Lecitase® (all available from Novo Mordisk k/S) . 

Examples of other lipases are Lumafast^'\ Ps . mendocina lipase 

from Genencor Int. Inc.; Lipomax'"^, Ps . pseudoalcaligenes lipase 

from Gist Brocades/Genencor Int. Inc.; Fusarium solani lipase 

30 (cutinase) from Unilever; Bacillus sp. lipase from Solvay 
enzymes. Other lipases are available from other companies. 
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The' accivity of the lipase can be determined as described in 
"Methods of Enzymatic Analysis", Third Edition, 1984, Verlag 
Chemie, Weinhein, vol. 4, or as described in AF 95/5 GB {avail- 
5 able on request from Novo Nordisk A/S) . 

In this context the expression "carbohydrase" means all enzymes 
capable of breaking down carbohydrate chains (e.g. starches) of 
especially five and six member ring structures (i.e. enzymes 

10 classified under the Enzyme Classification number E.G. 3.2 
(glycosidases) in accordance with the Recommendations (1992) of 
the International Union of Biochemistry and Molecular Biology 
(lUBMB) ) . Also included in the group of carbohydrases according 
to the invention are enzymes capable of isomerizing carbohydrates 

15 e.g. six member ring structures, such as D-glucose to e.g. five 
member ring str^actures like D-fr^actose. 

Examples include carbohydrases selected from those classified 
under the Enzyme Classification (E.G.) numbers: 

20 

a-amylase (3.2.1.1) p-aT.ylase (3.2.1.2), glucan 1,4-a- 

glucosidase (3.2.1.3), ceiiulase (3.2.1.4), endo-l,3(4)- p- 
glucanase (3.2.1.6), endo- 1 , 4 -^-xylanase (3.2.1.8), dextranase 

(3.2.1,11), chitinase (3,2.1.14), polygalacturonase (3.2.1.15), 
25 lysozyir.e (3.2,1,17), P - clucosidase ;3.2,1.21), a-galactosidase 

(3,2.1.22) , P-galacrosidase (3.2.1.23) , amylo-1 , 6 -glucosidase 

(3.2.1.33), xylan 1 , 4 - -xylosidase (3.2.1.37), glucan endo-l,3-p- 
D-glucosidase (3.2.1.39) , a-dextrin endo-1 , 6-glucosidase 

(3.2.1.41), sucrose a-glucosicase (3.2.1.48), glucan endo-l,3-a- 
30 glucosidase (3.2.1.59), glucan 1 , 4 -P-glucosidase (3.2.1.74), 
glucan endo- 1 , 6 -p-glucosidase (3.2.1.75), arabinan endo-1, 5-a- 

arabinosidase (3.2.1.99), lactase (3.2.1.108), chitonanase 

(3.2.1.132) and xylose isomerase (5.3.1.5). 

35 Examples of relevant carbohydrases include a-1 , 3 -glucanases 
derived from Trichoderma harzianim; a-1, 6-glucanases derived from 
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arabinosidase (3.2.1.99), lactase (3.2.1.108), chitonanase 
(3.2.1.132) and xylose isomerase (5.3.1.5). 

Examples of relevant carbohydrases include a-1 , 3 -glucanases 
5 derived from Trichoderma harzxanum; a-1 , 6-glucanases derived from 
a strain of Paecilomyces ; P-glucanases derived from Bacillus 
subtilis; p-glucanases derived from Humicola insolens; P- 
glucanases derived from Aspergillus niger; p-glucanases derived 
from a strain of Trichoderma; P-glucanases derived from a strain 

10 of Oerskovia xanthineolytica; exo- 1 , 4 -a-D-glucosidases (gluco- 
amylases) derived from Aspergillus' niger ; a-amylases derived from 
Bacillus subtilis; a-amylases derived from Bacillus 
amyloliquefaciens; a-amylases derived from Bacillus 

stearothermophilus ; a-amylases derived from Aspergillus oryzae; 

15 a-amylases derived from non-pathogenic microorganisms; a- 
galactosidases derived from Aspergillus niger; Pentosanases, 
xylanases, cellobiases, cellulases, hemi-cellulases derived from 
Humicola insolens; cellulases derived from Trichoderma reesei; 
cellulases derived from non-pathogenic mold; pectinases, 

20 cellulases, arabinases, hemi -celluloses derived from Aspergillus 
niger; dextranases derived from Penicillium lilacinum; endo- 
glucanase derived from non-pathogenic moid; pullulanases derived 
from Bacillus acidopullyticus ; P-galactosidases derived from 
Kluyveromyces fragilis ; >nvdanases derived from Trichoderma 

25 reessi; 

Specific examples of readily available commercial carbohydrases 
include Alpha-Gal^'\ Bio-Feed''* Alpha, Bio-Feed'" Beta, Bio-Feed'" 
Plus, Bio-Feed'-'' Plus, Novozymeo 138, Carezyme®, Celluclast®, 
30 Cellusoft®, Ceremyl®, Citrozym''', Denimax'", Dezyroe'", 
Dextrozyme'^, Finizym®, Fungamyl''' , Gamanase'^, Glucanex®, 
Lactozym®, Maltogenase'" , Pentopan'*", Pectinex™, Promozyme®, 
Pulpzyme'^", Novamyl^^, Termamyl®, AMG (Amyloglucosidase Novo), 
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Maltogenase®, Sweetzyme® , Aquazym® (all enzymes available from 
Novo Nordisk A/S) . Other carbohydrases are available from other 
companies . 

5 It is to be understood that also variants of such carbohydrases 
are contemplated as the polypeptide of interest. 

The activity of carbohydrases can be determined as described in 
"Methods of Enzymatic Analysis", third edition, 1984, Verlag 
10 Chemie, Weinheim, vol. 4. 

Oxidoreductases are here meant to be -enzymes classified under the 
Enzyme Classification number E.C. 1 (Oxidoreductases) in 
accordance with the Recommendations (1992) of the International 
15 Union of Biochemistry and Molecular Biology (ItJBMB) . 

Examples include oxidoreductases selected from those classified 
under the Enzyme Classification (E.C.) numbers: 

Glycerol-3 -phosphate dehydrogenase (NAD*) (1 . 1 . 1 . 8) , Glycerol-3- " 

20 phosphate dehydrogenase NAD(P)* (1.1.1.94), Glycerol-3 -phosphate 
1-dehydrogenase (NADP) (l.i.1.94), Glucose oxidase (1.1.3.4), •* 
Hexose oxidase (1.1.3.5), Cacechol oxidase (1.1.3.14), Bilirubin 
oxidase (1.3.3.5), Alanine dehydrogenase (1.4.1.1), Glutamate 
dehydrogenase (1.4.1.2), Glutamate dehydrogenase (NAD(P)*) 

25 (1.4.1.3), Glutamate dehycrogenase (NAD?*) (1.4.1.4), L-Amino 
acid dehydrogenase (1.4.1.5), Serine dehydrogenase (1.4.1.7), 
Valine dehydrogenase (NAD?*) (1.4.1.8), Leucine dehydrogenase 
(1.4.1.°), Glycine dehydrogenase (1.4.1.10), L-Amino-acid oxidase 
(1.4.3.2,), D-Amino-acid oxidase ( 1 . 4 . 3 . 3 ) , L-Glutamate oxidase 

30 (1.4.3.11), Protein-lysine o-oxidase (1.4.3.13), L-lysine oxidase 
(1.4,3.14), L-Aspartate oxidase (1.4.3.16), D-amino-acid 
dehydrogenase (1.4.99.1), Protein disulfide reductase (1.6.4.4), 
Thioredoxin reductase (1.6.4.5), Protein disulfide reductase 
(glutathione) (1.8.4.2), Laccase (i.10.3.2), Catalase (1.11.1.6), 

35 Peroxidase (1.11.1.7), Lipoxygenase (1.13.11.12), Superoxide 
dismutase (1.15.1.1) 
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Said Glucose oxidases may be derived from Aspergillus nigrer. 

Said Laccases may be derived from Polypoxus pinsitus , 
Myceliophtora thermophilBf Coprinus cinereus, Rhizoctonia solani, 
Rhizoctonia praticola, Scytalidium theunophilum and Rhus 
vernicifera. 

Bilirubin oxidases may be derived from Myrothechecium verrucaria. 

The Peroxidase may be derived from e.g. Soy bean, Horseradish or 
Coprinus cinereus . 

The Protein Disulfide reductase may be any mentioned in any of 
the DK patent applications no. 753/93, 265/94 and 264/94 (Novo 
Nordisk A/S) , which are hereby incorporated as reference, inclu- 
ding Protein Disulfide reductases of bovine origin, Protein 
Disulfide reductases derived frorr. Aspergillus oryzae or Asper- 
gillus niger, and DsbA or DsbC derived from Escherichia coli. 

Specific examples of readily available commercial oxidoreductases 
include Gluzyrp.e^*"' (enzvrr.e available from Novo Nordisk A/S) . 
However, other oxidoreductases are available from others. 

It is to be understood that also variants of oxidoreductases are 
conterr.plated as the pol\'P'ept ide of interest. 

The activity of oxidoreductases can be determined as described in 
"Methods of Enzyrriatic Analysis", third edition, 1984, Verlag 
Chemie, V/einheim, vol. 3. 



In this context transferases 
Enzym.e Classification numb>er 
Recommendations (1992) of the 
and Molecular Biology (IUBM3) . 



are enzymes classified under the 
E.C. 2 in accordance with the 
International Union of Biochemistry 
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The transferases may be any transferase in the subgroups of 
transferases: transferases transferring one-carbon groups (E.C. 
2.1); transferases transferring aldehyde or residues (E.C. 2.2); 
acyltransferases (E.C. 2.3); glucosyltransf erases (E.C. 2.4); 
5 transferases transferring alkyl or aryl groups, other that methyl 
groups (E.C. 2.5); transferases transferring nitrogenous groups 
(2.6) . 

In a preferred embodiment the transferease is a transglutaminase 
10 E.C 2.3.2.13 (Protein-glutamine y-glutamyltransferase) . 

Transglutaminases are enzymes capable of catalysing an acyl 
transfer reaction in which a y-carboxyamide group of a peptide- 
bound glutamine residue is the acyl donor. Primary amino groups 

15 in a variety of compounds may function as acyl acceptors with the 
subsequent formation of mono-substituted y-amides of peptide- 
bound glutamic acid. V/hen the epsilon-amino group of a lysine 
residue in a peptide-chain serves as the acyl acceptor, the,, 
transferases form intramolecular or intermolecular y-glutamyl -£-.. , 

20 lysyl crosslinks. 

Examples of transglutaminases are described in the pending DK 
patent application no. 990/94 (Novo Nordisk A/S) . 

25 The transglutaminase may the of human, aminal (e.g. bovine) or 
microbially origin . 

Examples of such transglutaminases are animal derived 
transglutaminases, FXIIIa; microbial transglutaminases derived 

30 from Physarum polycephalum (Klein et al , , Journal of Bacteriol- 
ogy, 174, p. 2599-2605); transglutaminases derived from Strep- 
tomyces sp., including Streptcr.yces lavendulae, Streptomyces 
lydicus (former Streptomyces libani) and Streptoverticillium sp., 
including Streptoverticilliuin mobaraense, Streptoverticillium 

35 cinnajnoneum, and Streptoverticillium griseocameum (Motoki et 
al,, US 5,156,956 ; Andou et al . , US 5,252,469; Kaempfer et al . , 
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Journal of General Microbiology, 137, 1831-1892 ; Ochi et al . , 
International Journal of Sytematic Bacteriology, 44, 285-292; 
Andou et al . , US 5,252,469; Williams et ai . , Journal of General 
Microbiology, 129, 1743-1813). 

5 

It is to be understood that also transferase variants are 
contemplated as the polypeptide of interest. 

The activity of transglutaminases can be determined as described 
10 in "Methods of Enzymatic Analysis", third edition, 1984, Verlag 
Chemie, Weinheim, vol. 1-10. 

In this context phytases are enzymes classified under the Enzym.e 
Classification number E.C. 3.1.3 (Phosphoric Monoescer 
15 Hydrolases) in accordance with the Recommendations (1992) of the 
International Union of Biochenist ry and Molecular Biology 
(lUB^TB) . 

Phytases are enzymes produced by r.icroorganisms which catalyse 
20 the conversion of ohvtate tc inositol and inorganic phosphorus 

Phvtase producing "icroorganisrr.s coriorise bacteria such as 
Bacillus subtilis, Bacillus natco and Pseudomonas ; yeasts such as 
Saccharcnyces carevisiae ; and fungi such as Aspergillus niger, 
25 Aspergillus ficuus::, Aspergillus awa.r,ori, Aspergillus oryzae, 
Asoeraillus terreus or Aspergillus nidulans, and various other 
Aspergillus species) . 

Examples of phytases include phytases selected from those 
30 classified under the Enzyrr.e Classification (E.C.) nurrlDers: 3- 
phytase (3.1.3.8) and o-phytase (3.1.3.26). 

The activity of phytases can be determined as described in 
"Methods of Enzymatic Analysis", third edition, 1984, Verlag 
35 Chemie, Weinheim, vol. 1-10, or may be measured according to the 
method described in EP-Al-0 420 358, Example 2 A. 
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In the present context an ant i -microbial polypeptides may be any 
polypeptide exhibiting ant i -microbial activities, such as anti- 
fungal, anti-bacterial, and/or anti-insecticidal activity. 

5 

Such polypeptides may also exhibit other activities such as 
enzymatic activity. 

Examples of anti-microbial polypeptides according to the 
10 invention include: fungicidally active polypeptides derived from 
the mold genus Curvularia described in WO 94/01459 (Novo Nordisk 
A/S) ; anti-bacterial polypeptides, described in EP 403.458 
(Kabigen AB) ; anti-microbial proteins isolated from the Mirabilis 
seed, described in WO 92/15691 (Imperial Chem Ind. PLC); anti- 
15 bacterial polypeptides isolated from an extract of pig small 
intestine, described in WO 92/22573 (Boman et al . ) ; polypeptide 
with yeast lethal action accumulated by yeast of Hansenula spp. 
as described in JP-60130599; Phycolacca insularis antiviral 
protein, which can be used as an anti-microbial described in US 
20 patent no, 5,348,865 (Jin Ro LTD.); bacteriolytic enzymes 
preparations derived from Nocardiopsis dassonvillei described in 
US patent: no. 5,354,681 (Novo Industiri A/S). 

Examples of other ant i -microbial polypeptides are maganinin, 
25 orotecrin, defensin, pseudomycin , mutanolysin and N- 
acetylmuramidase . 

The present invention in a further aspect relates to a method for 
generating a DNA sequence encoding a desired variant of a 
30 polypeptide of interest; wherein 

(i) a mutant library is produced by the above method, 

(ii) said library is cultivated under conditions conducive for 
the expression of said gene of interest to produce variant 

• polypeptides, 
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(iii) said variant polypeptides are screened or selected for a 
desired property, and hosts producing desired variants 
identified and isolated, 

(iv) the DNA encoding said variants is isolated. 

5 

The present invention in a still further aspect relates to a 
method for the determination of a DNA sequence encoding a desired 
variant of a polypeptide of interest, wherein 
(i) a mutant . library is produced as described above, 
10 (ii) said library is cultivated under conditions conducive for 
the expression of said gene of interest to produce variant 
polypeptides, . 

(iii) said variant polypeptides are screened or selected for a 
desired property, and hosts producing desired variants 

15 identified and isolated, 

(iv) said genetic element in said hosts is sequenced to 
elucidate the DNA sequence of the mutant gene encoding a 
desired variant. 



20 This aspect of the invention can be performed by making dilutions 
of the library (e.g. in nicrotiter plates) and culturing these, 
whereby copulations are made each originating from one member of 
the librar^y, and the varian:: polypeptide produced from each of 
the populations screened for the desired properties. 

25 Alternatively the library might be plated on agar-plates 
containing a desired growth medium that allows for the screening 
of or selection for desired properties of the variant 
polypeptide . 

30 If the phage display m.ethod is used, the screening or selection 
is performed directly with the phages. 

The criteria used for the selection will vary according to the 
end use of the polypeptide variant of interest, but properties 
35 typically being tested may include solubility and half-life in 
various media, antigenicity and allergenicity , thermal stability, 
oxidation stability, storage stability, substrate specificity and 



wo 97/25410 



PCT/DK97/00014 



27 

affinity, stability to non-aqueous solvents, pH profile, ionic 
strength dependence, catalytic efficiency, and compatibility with 
other components of envisaged end products wherein the 
polypeptide variant will form a part. 

5 

For enzymes to be used in detergents further properties to be 
investigated are, wash performance and compatibility with various 
surfaces, especially fabrics. 

10 Numerous other criteria could be mentioned. 

Upon identification of populations that produce variant 
polypeptides fulfilling the criteria selected, the DNA encoding 
the polypeptide variant of interest is isolated and sequenced by 
15 use of methods well known in the art". 

The invention furthermore comprises a process for the production 
of a desired polypeptide variant:, wherein 

(i) a DNA sequence determined as indicated above is introduced 
into a suitable host in a manner whereby it can be. 
expressed in said host, 

(ii) said host is cultivated under conditions conducive to the 
e:cpression of said DNA sequence, and 

(iii) said polypeptide variant is recovered. 

The present invention can be used with any cell, especially any 
microbial cell, but it is often suitable to use a prokaryote, 
especially a bacterium, preferably of the genus Bacillus , etc. 

Among the Bacilli it is preferred to use a strain chosen from the 
group comprising B. lentus, B. licheniformis, B. 
amyloliquefaciens, 3. subtiJis , etc. 



20 



30 



For some uses it is preferable tc use a microbial cell which is a 
35 fungus, especially a filamentous fungus, preferably of the genus 
Aspergillus, Triohodenua, etc. 
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Among the Aspergilli it is preferable to use a strain chosen from 
the group comprising A. oryzae, A, niger, A. awamori, etc. 

5 Among the Trichodeinna it is preferable to use a strain chosen 
from the group comprising T. reseei, etc. 

In yet other situations it is more expedient to use a mammalian 
cell chosen from the group comprising BKK, etc. cells. 

10 

The invention should not be construed to be limited to specific 
examples or embodiments mentioned in the specification above or 
the following examples. 

15 MATERIALS AiND METHODS 
E>Lft-M?LES 

20 

EX.AMPLE 1 

The system used is an Eschericia coli host cell, which is 
characrerized by a r.u~ber of chrcxoscT.ai mutations: 
2S i) a cs (therrr.Dsensir ive) rr;Utatior. in the polC gene (encoding 

DNA polymerase III, being the main replication polymerase) 
ii) a mutation in the polA gene {encoding DNA pol;~p.erase I) 
causing an increased error race by a reduction in the 3*- 
5' exonuclease activity. 
30 iii) repair deficiency by the "utL nutation. 

The target for the in vivo mutiacenesis is plasmid pBR322 (colEl 
origin) having either (i) a frame shift mutation, or (ii) a stop 
codon introduced into the tet gene, encoding a protein conferring 
35 resistance towards tetracyclin. 
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In each case the repair of the mutation leads to a dominant 
tetracycline resistant phenotype. 

pBR322 contains also the bla gene conferring resistance towards 
5 ampicillin. The higher mutagenesis frequency at the target region 
is seen as a higher frequency of tetracycline resistant colonies 
after plating a culture exposed to "mutation-introduction" 
conditions . 

10 An E. coli culture grown at 37°C to an optical density of l 
measured at 600 nm is exposed to 2, 4 or 16 hours at restrictive 
temperature, e.g. 42*'C. At these tim.e points dilution series of 
the cultures are plated on LBagar supplementet with 
1) ampicillin (AmpR colonies) 

15 2) tetracycline and ampicillin, (A:npR and tetR colonies) 

The ratio of tetracycline resistant colonies to ampicillin 

resistant colonies indicate the nurriber of cells in the culture 

that contains one copy of a repaired tet gene, indicated one 
20 specific mutagenesis event. 

This means, if a clone have become tetracycline resistant, at 
least one specific mutation has occurred to repair the originally 
introduced gene defect. 



25 
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PATEI.T CLAIMS 

1. 'A method for in vivo production of a library in cells 
comprising a multitude of mutated genetic elements, wherein an 

5 error-prone polymerase is used in each ancestral cell to 
replicate all or a part of a genetic element comprising 

i) an origin of replication from which replication is 
initiated, 

ii) optionally a genetic marker, e.g. a gene conferring 
10 resistance towards an antibiotic, 

iii) a gene encoding the polypeptide of interest, 
independently of the host chromosomal replication machinery. 

2. A method for in vivo production of a library in cells 
15 comprising a multitude of mutated genetic elements comprising 

A) providing a microbial cell having 

i) an error-prone pol-.—erase that independently of the 
chromosomal replica::icr. r.achinery of said cell will 
replicate all or a part of a genetic element 

20 comprising 

a) an origin of replication from which replication 
is initiated, 

b) optionally a genetic marker, e.g. a gene 
conferring resistance towards an antibiotic, 

25 c) a gene enccding the polypeptide of interest, 

and 

ii) a chromosomal replication machinery that can be 
reversibly induced to be substantially non- 
functional , 

30 B) growing such a cell under conditions conducive to its 

replication to obtain a r.ultitude of ancestral cells, 
C) reversibly inducing said chromosomal replication machinery 

in said ancestral cells tc be substantially non-functional 
, for a period of time sufficient to allow for the 
35 replication of said genetic element by said error-prone 

polymerase to generate mutations in said genetic element. 
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D) reversibly inducing said chromosomal replication machinery 
in such mutated cells to be substantially functional, and 

E) growing such mutated cells under conditions conducive to 
their replication. 

5 

3. The method of claim 1 or 2, wherein said cell further 
comprises a deficient repair system, 

4. The method of any of the claims 1 to 3 , wherein said 
10 genetic element is a plasmid, a phagemid, a phage, a virus, a 

retrovirus or a retrotransposon . 

5. The method of any of the claims 1 to 4 , wherein said cells 
are microbial cells. 

15 

6. The method of any of the claims 1 to 5 , wherein said 
error-prone polymerase is selected from the group comprising DNA 
pol I, DNA pol II, reverse transcriptase and more specifically 
E,coli DNA pol I, Bacillus siibzilis DNA pol I, HIV reverse 

20 transcriptase, T4 DNA polymerase, T7 DNA polymerase, Phi29 DNA 
polymerase , 

7. The method of any of the clair.s 1 to 6, wherein said 
chromosomal replication machiner^^ that can be reversibly induced 

25 to be substantially non- functional is a temperature sensitive E, 
coli DNA polymerase III 

8. A method for the determination of a DNA sequence encoding 
a desired variant of a polypeptide of interest, wherein 

30 (i) a mutant library is produced by the method of any of the 
claims 1 to 7, wherein said genetic element . comprises a 
gene encoding said polypeptide of interest, 
(ii) said library is cultivated under conditions conducive for 
the expression of said gene of interest to produce variant 

35 polypeptides, 
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(iii) said variant polypeptides are screened or selected for a 
desired property, and hosts producing desired variants 
identified and isolated, 

(iv) said genetic element in said hosts is sequenced to 
elucidate the DNA sequence of the mutant gene encoding a 
desired variant. 



9. A method for generating a DNA sequence encoding a desired 

variant of a polypeptide of interest, wherein 
10 (i) a mutant library is produced by the method of any of the 
claims 1 to 1, wherein said genetic element comprises a 
gene encoding said poi\pepciD£ of in L eras t, 

(ii) said library is cultivated under ccr.cir.ions conducive for 
the expression of said ger:e of inrerest to produce variant 

15 polypeptides, 

(iii) said variant polypeptides* rire scrsere-j or selected for a 
desired property, and hoses producir.g desired variants 
identified and isolated, 

(iv) the DNA encoding said variants is isolated, 

20 

10. A process for the production of a desired polypeptide 
variant, wherein 

(i) a DNA sequence obtained according to claim 9 or determined 
according to claim S is introduced into a suitable host in 

-5 a rr.anner whereby it can be expressed in said host, 

(ii) said host is cultivated under conditions conducive to the 
expression of said DNA sequence, and 

(iii) ' said polypeptide variant is recovered. 

30 11. A method of any of the claims 1 co 10, wherein said 
polypeptide of interest is an enzyrr.e. 

12. A method of claim. 11, v/nerein said enzyme is a carbonyl 
hydrolases , carbohydrases , oxidoreductases , transferases , 
35 phytases, ligases, lyases, and anti-microbial polypeptides. 
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13,. A method of claim 12, wherein said carbonyl hydrolase is a 
protease, or a lipase. 

14. A method of claim 12, wherein said carbohydrase is an 
5 amylase, glucosidase, cellulase, glucanase, xylanase, dextranase, 

chi tinase , polygalacturonase , lysozyme , glucosidase , 

galactosidase, xylosidase, arabinosidase, lactase, chitonanase, 
xylose isomerase, pectin esterase, rhamnogalacturonase, endo- 
glucanase . 

10 

15. A method of claim 12, wherein said oxidoreductase is a 
dehydrogenase, oxidase, reductase, La.ccase, Catalase, Peroxidase, 
Lipoxygenase, Superoxide dismutase. 

15 16. A method of claim 12, wherein said transferase is a 
transferase transferring one-carbon groups, a transferase 
transferring aldehyde or residues, acyltransf erase, 

glucosyltransf erase, transferase transferring alkyl or aryl 
groups, other that methyl groups, transferase transferring 

20 nitrogeneous groups. 

17. A method of any. of the claims 1 to 16, wherein said cell 
is a prokaryote, especially a bacteriurp., preferably of the genus 
Bacillus, Escherichia, Staphylococcus , and Streptococus . 

25 

18. A method of claim 17, wherein said Escherichia is a strain 
chosen from E, coli. 

19. A m.ethod of claim 17, wherein said Bacillus is a strain 
30 chosen from the group comprising B. lentus, B. lichenifonuis, B. 

amyloliquefaciens, B, subtilis, 

20. A method of any of the claims 1 to 16, wherein said cell 
is a • fungus, especially a yeast or a filamentous fungus, 

3 5 preferably of the genus Aspergillus, Trichoderma, 
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21. * A method of claim 20, wherein said Aspergillus is chosen 
from the group comprising A. oryzae, A, niger, A, awamori, 

22. A method of claim 20, wherein said Trichoderma is chosen 
from the group comprising T. reseei . 

23. A method of any of the claims 1 to 16, wherein said cell 
is a mammalian cell chosen from the group comprising BHK cells or 
an insect cell. 
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